Towards modeling user behavior in interactions mediated through an automated bidirectional speech translation system
نویسندگان
چکیده
This paper addresses modeling user behavior in interactions between two people who do not share a common spoken language and communicate with the aid of an automated bidirectional speech translation system. These interaction settings are complex. The translation machine attempts to bridge the language gap by mediating the verbal communication, noting however that the technology may not be always perfect. In a step toward understanding user behavior in this mediated communication scenario, usability data from doctor–patient dialogs involving a two way English–Persian speech translation system are analyzed. We specifically consider user behavior in light of potential uncertainty in the communication between the interlocutors. We analyze the Retry (Repeat and Rephrase) versus Accept behaviors in the mediated verbal channel and as a result identify three user types – Accommodating, Normal and Picky, and propose a dynamic Bayesian network model of user behavior. To validate the model, we performed offline and online experiments. The experimental results using offline data show that correct user type is clearly identified as a user keeps his/her consistent behavior in a given interaction condition. In the online experiment, agent feedback was presented to users according to the user types. We show high user satisfaction and interaction efficiency in the analysis of user interview, video data, questionnaire and log data. 2009 Elsevier Ltd. All rights reserved.
منابع مشابه
An English-persian Automatic Speech Translator: Recent Developments in Domain Portability and User Modeling
In this paper we describe the English-Persian speech to speech translation device, and provide insight on the lessons learned during the first phase of development of this system. We start by giving an overview of the underlying components of the device: the front end ASR, the machine translation system and the speech generation system. Challenges such as the sparseness of available spoken lang...
متن کاملTowards modeling user behavior in human-machine interactions: Effect of Errors and Emotions
Data-driven approaches to spoken dialog strategy design rely on a sound understanding and modeling of user behavior in their interaction with machines. The spoken language usermachine communication channel is inherently noisy; noise in the channel may be due to errors in machine speech recognition, language understanding or other machine/user communication uncertainty and errors. Hence, annotat...
متن کاملIraqComm and FlexTrans: A Speech Translation System and Flexible Framework
SRI International's IraqComm system performs bidirectional speech-to-speech machine translation between English and Iraqi Arabic in the domains of force protection, municipal and medical services, and training. The system was developed primarily under DARPA's TRANSTAC Program and includes: speech recognition components using SRI's Dynaspeak engine; MT components using SRI's Gemini and SRInte...
متن کاملModeling Emotional State and Personality for Conversational Agents
We describe an architecture for constructing a character-based agent based on speech and graphical interactions. The architecture uses models of emotions and personality encoded as Bayesian networks to 1) diagnose the emotions and personality of the user, and 2) generate appropriate behavior by an automated agent in response to the user’s input. Classes of interaction that are interpreted and/o...
متن کاملInteractive Speech Translation in the Diplomat Project 2 Multi-engine Machine Translation
The DIPLOMAT rapid-deployment speech translation system is intended to allow na ve users to communicate across a language barrier, without strong domain restrictions, despite the error-prone nature of current speech and translation technologies. Achieving this ambitious goal depends in large part on allowing the users to interactively correct recognition and translation errors. We brieey presen...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Computer Speech & Language
دوره 24 شماره
صفحات -
تاریخ انتشار 2010